Optimum bit allocation and decomposition for high quality audio coding
نویسندگان
چکیده
Current audio compression schemes are capable of reducing the per channel bit rate of high quality audio signals from 16 bits per sample to around 2-4 bits per sample. In these schemes, knowledge of psychoacoustics is utilised and a uniform or nonuniform frequency decomposition method is used. In this paper we derive the optimum bit allocation to achieve the highest perceptual quality under a fixed bit rate, for an arbitrarily decomposed, critically sampled, filter bank. The resultant optimum bit allocation gives rise to a shaped reconstruction noise floor approximately parallel to the masking threshold level. Perceptual coding gain is defined and should be maximized for an optimum decomposition performed by the filter bank. Optimum band splitting is discussed and it is pointed out that decomposition in the manner of critical band splitting does not lead to optimal performance.
منابع مشابه
A Perceptually Based Embedded Subband Speech Coder - Speech and Audio Processing, IEEE Transactions on
A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented. An infinte impulse response (IIR) quadrature mirror filterbank (QMF) performs subband decomposition. A perceptual model, computed using subband spectral analysis, optimizes the coder’s perceptual quality. Dynamic bit allocation a...
متن کاملSubband Coding of Digital Audio Signals
A subband coding method for high-quality digital audio signals is described. To achieve low bit rates at a high quality level, it exploits the simultaneous masking effect of the human ear. It is shown how this effect can be used in an adaptive bit allocation scheme. The method is capable of reducing the bit rate of a compact disk signal by a factor of seven. Results obtained with a low-complexi...
متن کاملHigh Quality Audio Compression Using anAdaptive Wavelet Packet Decomposition andPsychoacoustic
| This paper presents a technique to incorporate psychoacoustic models into an adaptive wavelet packet scheme to achieve perceptually transparent compression of high quality(44.1 KHz) audio signals at about 45 KBits/sec. The lter bank structure adapts according to psychoacous-tic criteria and according to the computational complexity that is available at the decoder. This permits software imple...
متن کاملAn E cient , Low - Complexity Audio Coder Delivering
This paper proposes an eecient, low complexity audio coder based on the SPIHT (set partitioning in hierarchical trees) coding algorithm 5], which has achieved notable success in still image coding. A wavelet packet transform is used to decompose the audio signal into 29 frequency subbands corresponding roughly to the critical subbands of the human auditory system. A psychoacustic model ,which, ...
متن کاملA New Criterion and Associated Bit Allocation Method for Current Audio Coding Standards
This paper presents a new noise-shaping criterion. Based on the new criterion, we derive an efficient bit allocation method. The bit allocation method is applicable to the current audio standards like MPEG1 Layer 3 and MPEG4 AAC. The bit allocation method has gained a speed up for more than ten and has resulted in better quality over the traditional two nested loop method presented in ISO draft...
متن کامل